Formal Definition of QoE Metrics
نویسندگان
چکیده
This technical report formally defines the QoE metrics which are introduced and discussed in the article “QoE Beyond the MOS: An In-Depth Look at QoE via Better Metrics and their Relation to MOS” by Tobias Hoßfeld, Poul E. Heegaard, Martı́n Varela, Sebastian Möller, accepted for publication in the Springer journal ’Quality and User Experience’ [2]. Matlab scripts for computing the QoE metrics for given data sets are available in GitHub [3]. I. DEFINITION OF QOE METRICS A. Preamble We consider studies where users are asked their opinion on the overall quality (QoE) of a specific service. The subjects (the participants in a study that represent users), rate the quality as a quality rating on a quality rating scale. As a result, we obtain an opinion score by interpreting the results on the rating scale numerically. An example is a discrete 5-point scale with the categories 1,’bad’, 2,’poor’, 3,’fair’, 4,’good’, and 5,’excellent’, referred to as an Absolute Category Rating (ACR) scale [6]. Let U be a random variable (RV) that represents the quality ratings, U ∈ Ω, where Ω is the rating scale, which is also the state space of the random variable U . The RV U can be either discrete, with probability mass function fu, or continuous, with probability density function f(u). The notation used in the paper is summarized in Table I. B. Observations in a study are samples of U In a study, each observation can be regarded as samples Ui,r,c of U . Each sample corresponds to the observation i of subject r, under test condition c, using the rating scale Ω. Typically, the notation can be simplified, e.g., if the identity of a subject is not an issue, and we look at only one test with the same test conditions c, then the set of observations will form a set of samples U = {Ui}, i = 1, · · · , R. More details about test conditions are found in the next section. C. Specification of test conditions A study typically specifies a set of test conditions, C, which includes both technical and non-technical factors. The set of test conditions contains an invariant (static) part that is unchanged over all tests, C0, and a variable (dynamic) part that changes between each of the J tests, Cj , (j = 1, · · · , J). For each test, the test conditions might change according to a pattern specified on a set (of size k), Cj = {cj,1, · · · , cj,k}. If Cj is not the same for all R subjects in test j, then the notation must be extended and indexed with, r, i.e. Cj,r. Furthermore, in some studies we don’t have the same number of subjects per test condition. To specify this, you should add an index to the number of subjects Rj of test j. As an example, consider a study of the effect of a specific sequence of changes in the video quality classes, where each subject is expected to rate the overall technical quality of the video. In each test a specific set of quality classes, qn, is defined, let’s say N classes, in an unordered set Q = {q1, · · · , qN}. The variable part of C is then defined as an ordered set Cj = (cj,1, · · · , cj,k), where c contains both the quality class, q, and the time, t, for the quality class changes, c = (t, q). If the sequence of changes is not the same for each subject in a test, either with respect to q or t, then the notation must capture this by extending and indexing the Cj with r as described above. If the test is about the effect of a number of changes, k, and not a specific sequence of such, then Cj = (k,Q). Finally, if each test only has one condition, Table I KEY VARIABLES AND NOTATIONS USED IN THE PAPER. notation meaning MOS Mean Opinion Score SOS Standard deviation of Opinion Score ACR Absolute Category Rating U random variable (RV) for quality ratings U upper limit of a rating scale U− lower limit of a rating scale U set of ratings {Ui} for a particular test condition R number of subjects (= ratings) per test condition J number of tests S Statistical definition set ({Ω, C,Σ,S}) Ω quality rating scale (same as the sample space of the RV U ) C set of test conditions Σ set of statistics S set of (sufficient) observators u quality rating value for a test condition (e.g. MOS on the ACR scale) fu probability that the quality rating is u ∈ Ω f̂u estimate of fu, i.e. ratio of subjects who rate the test condition with u S(u) SOS as a function of mean opinion score u a SOS parameter of SOS hypothesis in Eq. (11) Aθ α-acceptability, i.e. the probability that ratings are above θ, P (U > θ) %PoW Poor-or-Worse (in %) %GoB Good-or-Better (in %) %TME Terminate Early (in %)
منابع مشابه
User-Oriented QoE Metrics and Subjective Assessment for Video Streaming Services on 3G Mobile Internet
To provide customer-satisfied video streaming services over 3G, network services providers turn to measure the video quality in term of quality of experience (QoE) rather than quality of service (QOS). In formal QoE definitions for video streaming services over 3G, QoE is either regarded as the average user opinion from the subjective point of view or combination of some technical parameters fr...
متن کاملImproving Quality of Experience for Multimedia Services by Qos Arbitration on a Qoe Framework
This contribution presents a Quality of Experience (QoE) framework. It evaluates QoE using QoS metrics, network feedback, and dynamic user requirements, and proposes a definition for QoE. A model for experimenting within the QoE framework is presented. It is believed that a better QoE can be achieved when the QoS and its interaction with the network and application layers is considered as a who...
متن کاملCorrelation of Bottleneck Metrics and Quality of Experience
Describing service quality from the end-users' point of view is fulfilled by using Quality of Experience (QoE) metrics. In a networking environment QoE metrics are very close to Quality of Service (QoS), but – because of the subjective nature of QoE – they could not be derived directly from QoS. Collection and analysis of QoS and SLS (Service Level Specification) properties of networking servic...
متن کاملNarrowing the Gap Between QoS Metrics and Web QoE Using Above-the-fold Metrics
Page load time (PLT) is still the most common application Quality of Service (QoS) metric to estimate the Quality of Experience (QoE) of Web users. Yet, recent literature abounds with proposals for alternative metrics (e.g., Above The Fold, SpeedIndex and variants) that aim at better estimating user QoE. The main purpose of this work is thus to thoroughly investigate a mapping between establish...
متن کاملQuality of Experience: From User Perception to Instrumental Metrics (Dagstuhl Seminar 12181)
This report documents the program and the outcomes of Dagstuhl Seminar 12181 “Quality of Experience: From User Perception to Instrumental Metrics”. As follow-up of the Dagstuhl Seminar 09192 “From Quality of Service to Quality of Experience", it focused on the further development of an agreed definition of the term Quality of Experience (QoE) in collaboration with the COST Action IC1003 “Qualin...
متن کاملAn Enhanced Analytical Framework Integrating aspects of Network and Application performance, Psychology, and Content in QoE Assessment for Web browsing
Evaluation of Quality of Experience (QoE) for web browsing is a challenging topic as it involves many different factors related to human as well as technical issues. To understand such a large number of human and technical metrics is both difficult and tricky. Two main objectives of the research presented in this paper are to propose and develop a new QoE assessment approach based on a comprehe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1607.00321 شماره
صفحات -
تاریخ انتشار 2016